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Window shaping ftinctions for watermarking of multimedia signals 



EPO - DG 1 
2 8. 03. 2002 



The present invention relates to window shaping functions, and the use of such 
functions in apparatus and methods for encoding and decoding information in multimedia 
signals, such as audio, video or data signals. 

5 Watermarking of multimedia signals is a technique for the transmission of 

additional data along with the multhnedia signal. For mstance. watermarking techniques can 
be used to embed copyright and copy control information into audio signals. 

The main requirement of a watermarking scheme is that it is not observable 
(i.e. in the case of an audio signal, it is inaudible) whUst being robust to attacks to remove the 
10 watemiark ftom the signal (e.g. removing the watemiark wUl damage the signal). It will be 
appreciated that the robustness of a watermark will normally be a trade off against the quaUty 
of the signal m which the watermark is embedded. For instance, if a watermark is strongly 
embedded into an audio signal (and is thus difficult to remove) then it is Ukely that the 
quaHty of the audio signal will be reduced. 
15 Various types of audio watermarkmg schemes have been proposed, each with 

its ovm advantages and disadvantages. For instance, one type of audio watermarking scheme 
is to use temporal correlation techniques to embed the desired data (e.g. copyright 
mfomxation) into the audio signal. This technique is effectively an echo-hiding algorithm, in 
which the strength of echo is determined by solving a quadratic equation. The quadratic 
20 equation is generated by auto-correlation values at two positions: one at delay equal to x. and 
one at delay equal to 0. At the detector, the watermark is extracted by determining the ratio of 
the auto correlation function at the two delay positions. 

WO 00/00969 describes an alternative technique for embedding or encoding 
25 auxiUary signals (such as copyright information) into a multimedia host or cover signal. A 
leplica of the cover signal, or a portion of the cover signal in a particular domain (time, 
ftequency or space), is generated according to a stego key, which specifies modification 
values to the parameters of the cover signal. Ihe replica signal is then modified by an 
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auxiliary signal corresponding to the information to be embedded, and inserted back into the 
cover signal so as to form the stego signal. 

At the decoder, in order to extract the original auxiliary data, a replica of the 
stego signal is generated in the same manner as the replica of the original cover signal, and 
5 requires the use of the same stego key. The restilting replica is then correlated with the 
received stego signal so as to extract the auxiliary signal. 

In such watemaarking schemes the additional data to be embedded within the 
multimedia signal typically takes the form of a sequence of values. This sequence of values is 
then converted into a slowly varying narrow-band signal by applying a window shaping 
10 function to each value. To date, only bell shaped window shaping functions such as raised 
cosine functions (e.g. the Hanning window function shown in Fig. 1) have been utilized. 

It is an object of the present invention to provide an alternative window 
shaping function that allows improved performance over prior art window shaping functions. 

15 In a first aspect, the present invention provides a method of generating a 

watermark signal for embedding in a multimedia host signal, the method comprising the steps 
of: taldng a first sequence of values; applying a window shaping function to said sequence of 
values so as to form a smoothly varying signal suitable for embedding in the host signal; 
wherein the integral over the window shaping function is zero. 

20 Preferably, said window shaping function has an anti-symmetric temporal 

behavior. 

Preferably, said window shaping function has a bi-phase behavior. 
Preferably, the bi-phase window comprises at least two Harming windows of 
opposite polarities. 

25 Preferably, the frequency spectrum of the smoothly varying signal has a DC 

component less than a component of any non-DC peak within the fi-equency spectrum. 

Preferably, each value of the first sequence is represented by a pulse train of 
width Ts so as to form a rectangular wave signal, the window shaping function also being of 
width Ts. 

30 Preferably, said first sequence of values is convolved with the window shaping 

function so as to form said smoothly varying signal. 

Preferably, the method further comprises the step of embedding said smootihily 
varying signal into the host signal. 
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In a further aspect, the present invention provides an apparatus arranged to 
generate a watermark signal suitable for embedding in a host multimedia signal, the 
apparatus comprising: 

a) a signal generator arranged to generate a watermark signal by taking a jfirst 
5 sequence of values; and 

b) processing means arranged to apply a window shaping function to said 
sequence of values so as to form a smoothly varying signal suitable for embedding in a host 
signal; wherein the integral over the window shaping function is zero. 

Preferably, the apparatus further comprises a watermark embedding apparatus 
1 0 that embeds said smoothly varying signal into the host signal. 

In another aspect, the present invention provides a multimedia signal 
comprising a watermark, wherein the original multimedia signal has been watermarked by a 
smoothly varying signal formed by applying a window shaping function to a sequence of 
values, the integral over the window shaping function being zero, 
1 5 Preferably, the temporal envelope of the original signal has been modified by 

the watermark. 

In a further aspect, the present invention provides a method of detecting a 
watermark signal embedded in a multimedia signal, the method comprising the steps of: 

receiving a multimedia signal that may potentially be watermarked by a 
20 watermark signal modifying the host multimedia signal; 

extracting an estimate of the watermark firom said received signal by assuming 
that the watermark comprises a sequence of values to which a window shaping function has 
been applied, the integral over the window shaping function being zero; and 

processing the estimate of the watermark with a referenced version of the 
25 watermark so as to determine whether the received signal is watermarked. 

Preferably, the method ftirther comprises the step of applying a window 
shaping function to said received signal, the integral over the window shaping function being 
zero. 

Pieferably, the watermark signal has a payload, and the method further 
30 comprises the step of determining the payload of the watermark. 

In another aspect, the present invention provides a watermark detector 
apparatus arranged to detect whether a watermark signal is embedded within a multimedia 
signal, the watermark detector comprising: 
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a receiver arranged to receive a multimedia signal that may potentially be 
watermarked by a watermark signal modifying the host multimedia signal; 

an extractor arranged to extract an estimate of the watermark from said 
received signal by assuming that the watermark comprises a sequence of values to which a 
5 window shaping function has been applied, the integral over the window shaping function 
being zero; and 

a processor arranged to process the estimate of the watermark vrith a 
referenced version of the watermark so as to determine whether the received signal is 
watermarked. 

10 Preferably, the apparatus further comprises a unit arranged to apply a window 

shaping function to the said received signal, wherein the integral over the window shaping 
function is zero. 

For a better understanding of the invention, and to show how embodiments of 
1 5 the same may be carried into effect, reference will now be made, by way of example, to the 
accompanying diagrammatic drawings in which: 

Figure 1 illustrates a Hanning window shaping function, as utilized in the prior 

art; 

Figure 2 illustrates a bi-phase window shaping function according to a 
20 preferred embodiment of the present invention, in which the shapes of the two lobes are 
Hanning window functions; 

Figure 3 illustrates frequency spectra for a sequence Wdi[k]={ 1,1,-1,1,-1,-1} 
conditioned with respectively a Haiming window shaping function and a bi-phase window 
shaping function; 

25 Figure 4 illustrates a sequence Wi formed by conditioning the sequence Wdi 

with the bi-phase window shaping function shown in Fig. 2, and a running integral of Wi (J 
Wi); 

Figure 5 illustrates a sequence Wi formed by conditioning tiie sequence Wdi 
with the Hanning window shaping function, and a running integral of Wi (f wO; 
30 Figure 6 is a diagram illustrating a watermark embedding apparatus in 

accordance with an embodiment of the present invention; 

Figure 7 shows a signal portion extraction filter if used in one preferred 

embodiment; 
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Figures 8a and 8b show respectively the typical amplitude and phase responses 
of the filter H shown in Fig. 7 as functions of firequency; 

Figure 9 shows the payload embedding and watennark conditiomng stage; 

Figure 10 is a diagram illustrating the details of one possible implementation 
of the watermark conditioning apparatus He of Fig. 9, mcluding charte of the associated 

signals at each stage; 

Figure 11 isadiagram illustrating a watermark detector in accordance with an 

embodiment of the present invention; 

Figure 12 diagrammaticaUy shows the whitening filter H.of Fig. U, for use in 

conjunction with a bi-phase window shaping function; 

Figure 13 shows a typical shape of the correlation function output from the 
correlator of the watemiark detector shown m Fig. 11 ; and 

Figure 14 Ulustrates a further window shapmg function according to an 
alternative embodiment of the present mvention. 

Fig. 2 shows a wmdow shying function as a function of time according to a 
preferred embodiment of the present invention. The integral over the window shaping 
function is zero i.e. the total positive area of tiie function is equal to the total negative area 
(such tiiat tiie average area is zero). The wmdow shaping function is a bi-phase function witix 
anti-symmetdctemporalbehavior. where eachlobe of the window fimction is aHanmng 
window function. 

Use of tUs window shaping function wiftinwatennaridng schemes has been 
shown to offer imprnved per&nnance con^a^d wift Ihe use of the Hanning window 

shaping function shown in Fig. 1 . 

Fig. 3 illustrates the frequency spectra corresponding toawatermark sequence 

(wa[k]={l 1 -1.1,-1,-1}) conditioned withrespectivelyaHamiingandabi.phase window 
shal>ing fimction. As can be seen, tiie frequency spectirmn for the Hamiing window 
conditioned watermark sequence has a maximum at frequency ^, whilst the frequency 
spectrum for ihe bi-phase sh^ed watemiark sequence hasammHnumat^i.e. it has very 

) little DC component. 

m many instances, usefiil mformation is contained m tiie non-DC component 
of the watemiaik only. Consequentiy, for tiie same added watermark energy, a watermark 
conditioned witii tiie bi-phase wmdow will carry more useful infomiation tiian one 
conditioned by tiie Hanning window shapmg function. As a result, tiie bi-phase wmdow 



i'JHNL020241EPP. 



10 



30 



^ 26.03.2002 
offers superior audibility performance for the same robustness, or conversely, it allows a 
better robustness for the same audibility quality. 

Fig. 4 illustrates a normalized integral (shown as a dotted line) for the 
sequence wj,- conditioned with the bi-phase window shaping function shown in Fig. 2. 
Conversely, Fig. 5 shows a normalized integral for the same sequence conditioned with the 
Hanning window shaping function. It can be seen that the maximum value of the normalized 
integral is lower for the sequence conditioned by the bi-phase window function as compared 
to that for the sequence conditioned by the Hamiing window function. 

Use of this window shaping function wiU now be described in conjunction 
with a watermarking scheme. However, it will of course be appreciated that the appHcation of 
this window shaping function is not restricted to the below scheme, but could be applied Id 
other watermarking techniques, particularly time domain watennaridng techniques. It can 
also be used to cairy secret keys (e.g. cryptographic keys) that can be used for the re- 
generation of reference random sequences at the detector side, allowing the possibility of 
15 embedding different random sequences m different host signals. 

Fig. 6 shows a block diagram of the apparatus reqmred to perform the digital 
signal processing for embedding a multi-bit payload watermark Wc into a host signal x in 
accordance with a preferred embodiment to the present invention. 

A host signal x is provided at an input 12 of the ^paratus. The host signal x is 
passed in the dh-ection of output 14 via the adder 22. However, a replica of the host signal x 
(mput 8) is split off in the dnection of the multipUer 18, for carrying the watermark 
information. 

The watermark signal Wc is obtained from the payload embedder and 
watermark conditioning apparatus 6, and is derived from the watermark random sequence v.^, 
which is input to the payload embedder and watermark conditioning apparatus. The 
multiplier 18 is utilized to calculate the product of the watermark signal Wc and the replica 
audio signal x. The resulting product, w^x is then passed via a gain controller 24 to the adder 
22. The gain controller 24 is used to ampUfy or attenuate the signal by a gain factor a. 

The gain factor a controls the trade off between the audibility and the 
robustness of the watermark. It may be a constant, or variable in at least one of tune, 
frequency and space. The apparatus in Fig. 6 shows that, when a is variable, it can be 
automatically adapted via a signal analyzing unit 26 based upon the properties of the host 
signal X, Preferably, the gain a is automatically adapted, so as to minimize the impact on the 
signal quality, according to a properly chosen perceptibility cost-function, such as apsycho- 
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acoustic model of the human auditory system (HAS). Such a model is, for instance, described 
in the paper by E.Zwicker, "Audio Engineering and Psychoacoustics: Matching signals to the 
final receiver, the Human Auditory System", Journal of the Audio Engineering Society, Vol. 
39, pp. VoLl 15-126, March 1991. 

In the following, an audio watermark is utilized, by way of example only, to 
describe this embodiment of the present invention. 

The resxdting watermark audio signal y is then obtained at the ou^ut 14 of the 
embedding apparatus 10 by adding an appropriately scaled version of the product of Wq and x 
to the host signal: 

y\n\ = y\n\ + awc • (1) 



Preferably, tiie watermark Wc is chosen such that when mxiltiplied with x, it 
predominantly modifies the short time envelope of x. 

Fig. 7 shows one preferred embodiment in which the input 8 to the multiplier 
1 8 in Fig. 6 is obtained by filtering the replica of the host signal x using a filter /f in the 
1 5 filtering unit 1 5. If the filter output is denoted by Xh, then according to this preferred 
embodiment, the watermark signal is generated by adding the product of Xh and the 
watermark Wc to the host signed x. 

Let Xj, be defined such that x^ = x-x^ , and be defined such that 

y = + Xft , then the watermarked signal can be written as 

= (1+ WcW)^* W+^*M • (2) 



and the envelope modulated portion of the watermarked signal y is given as 

yM^a+wMMn] (3) 



Preferably, as shown in Fig. 8, the filter ffis a linear phase band-pass filter 
characterized by its lower cut off firequency fi and upper cut off frequency^. As can be seen 

25 in Fig. 8b, the filter H has a linear phase response with respect to firequency / within the pass 
band (BW). Thus, when is a band-pass filter, xi, and Xi, are the in-band and out-of-band 
components of the host signal respectively. For optimxmi performance, it is preferable that 
the signals Xd and Xi, are in phase. This is achieved by appropriately compensating for the 
phase distortion produced by filter H. In the case of a linear phase filter, the phase distortion 

30 is a simple delay. 



PHNL020241EI 

8 26.03,2002 
In Fig. 9, the details of the payload embedder and watermark conditioning unit 
6 is shown. In this unit the watermark seed signal Ws is converted into a multi-bit watermark 
signal Wc. 

Firstly a finite length, preferably zero mean and uniformly distributed random 
5 sequence Ws is generated using a random number generator with an initial seed 5. As will be 
appreciated later, it is preferable that this initial seed S is known to both the embedder and the 
detector, such that a copy of the watermark signal can be generated at the detector for 
comparison purposes. This results in the sequence of length 

wAk] e [-1,1], for k^0,l2, .... X^-i (4) 
Then the sequence ws is circularly shifted by the amounts dj and rf^ using the 

10 circularly shifting units 30 to obtain the random sequences Wdj and Wd2 respectively. It will be 
appreciated that these two sequences (wdi and Wd2) are effectively a first sequence and a 
second sequence, with the second sequence being circularly shifted with respect to the first 
Each sequence w^h i = 1>2, is subsequently multiplied with a respective sign bit in the 
multiplying unit 40, where ri = +1 or -1 , the respective values of ri and Vz remaining constant, 

15 and only changing when the payload of the watermark is changed. Each sequence is then 
converted into a slowly varying narrow-band signal Wi of length L^Ts by the watermark 
conditioning circuit 20 shown in Fig. 9. Finally, the slowly varying narrow-band signals Wj 
and W2 are added with a relative delay Tr (where Tr<T^ to give the multi-bit payload 
waterinark signal w^. This is achieved by jSrst delaying the signal W2 by the amoimt TV using 

20 delaying unit 45 and subsequently by adding it to wi with the adding unit 50. 

Fig. 10 shows one possible implementation of the watermark conditioning 
apparatus 20 used in the payload embedder and watermark conditioning apparatus 6 in more 
detail. The watermark random sequence Ws is input to the conditioning apparatus 20. 

For convenience, the modification of only one of the sequences w^// is shown 

25 in Fig. 1 0, but it will be appreciated that each of the sequences is modified in a similar 
manner, with the results being added to obtain the watermark signal Wc. 

As shown in Fig. 10, each watermark signal sequence Wdi[k], 1=^1,2 is applied 
to the input of a sample repeater 180. Chart 181 illustrates one of the possible sequences Wdi 
as a sequence of values of random numbers between +1 and —1, with the sequence being of 

30 length Ly^. The sample repeater repeats each value witbin the watermark random sequence Ts 
times, so as to generate a pulse train signal of rectangular shape. Ts is referred to as the 
watermark symbol period and represents the span of the watermark symbol in the audio 
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signal. Chart 183 shov^ the results of the signal illustrated in chart 181 once it has passed 

through the sample repeater 180. 

The window shaping function s[n]. which is a bi-phase fimction as shown in 
Fig. 2, is then applied to convert the rectangular pulse signals derived ftom and into 
slowly varying signals w^n] and W2M respectively. The wmdow shaping function is of 

width Ts. , , 

The generated signals wi[n] and W2[n] are then added up with a relative delay 

Tr (where Tr<T^. to give the multi-bit payload watermark signal Wc[n] i.e. 
' „^ (5) 

The value of TV is chosen such that the zero crossings of w/ match the 
maximum ampHtude points of and vice-versa. Thus, for this bi-phase wmdow sh^mg 
function Tr^W. For other window shaping functions, other values of are posable. 

As wUl be appreciated by the below description, during detection the 
correlation of Wc[n] will generate two correlation peaks that are separated hypL (as canbe 
15 seen inFig. 13). The value pL is part of the payload, and is defined as 

InadditiontopL,extrainformationcanbeencodedbychangingfherelativesi^ • ,v 

embedded watermarks. In the detector, this is seen as a relative sign between the 
correlation peaks. It wiU be seen that r.,^ can take four possible values, and may be defined 

20 as: 

.s.. =^^^^^-{0.1.2.3} <^ 

where pj-Bx&^cLO and /^signrda) are respectively estimates of the sign bits n (input 80) 
and r2 (input 90) of Fig. 9, and cl; and cL^ are the values of the correlation peak 
correspondmg to and respectively. The overall watmnark payload/^i.. for an error 
25 fiee detection, is then given as a combination of rsign and pL: 



pK-irs\^iPj)' 



(8) 
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The maximum infonnation in number of bits, that can be carried by a 
watermark sequence of length Ly, is thus given by: 



bits 



Fig. 1 1 shows a bloclc diagram of a watermark detector (200, 300, 400). The 
detector consists of three major stages: (a) the watermark symbol extraction stage (200), (b) 
the buffering and mterpolation stage (300), and (c) the correlation and decision stage (400). 

In the symbol extraction stage (200), the received watermarked signal y '[n] is 
processed to generate multiple (JV*) estimates of the watermarked sequence. These estimates 
of the watermark sequence are required to resolve a time offeet that may exist between the 
embedder and the detector, so that the watermark detector can synchronize to the watermark 
sequence inserted in the host signal. 

In the buffering and interpolation stage (300), these estimates are de- 
multiplexed into Nb separate bujffers, and an interpolation is applied to each buffer to resolve 
possible timescale modifications that may have occurred e.g. a drift in sampling (clock) 
fi-equency may have resulted in a stretch or shrink in the time domain signal (i.e. the 
watermark may have been stretched or shrunk). 

In the correlation and decision stage (400), the content of each buffer is 
correlated with the reference watermark and the maximum correlation peaks are compared 
against a threshold to determine the likelihood of whether the watermark is indeed embedded 
within the received signal y '[n ]. 

In order to maximize the accuracy of the watermark detection, the watermark 
detection process is typically carried out over a length of received signal y '[n] that is 3 to 4 
times that of the watermark sequence length. Thus each watermark symbol to be detected can 
be constructed by taking the average of several estimates of said symbol. This averaging 
process is referred to as smoothing, and the number of tunes the averaging is done is referred 
to as the smoothing factor Sf. Thus the detection window lengtih Lo is the length of the audio 
segment (in number of samples) over which a watermark detection truth-value is reported. 
Consequently, Ld^s/^T^, where T, is the symbol period andZ,, the number of symbols 
within the watermark sequence. Typically, the length {10 of each buffer 320 within the 
buffering and interpolation stage is Lb=SfLy,. 
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In the watemark symbol extraction stage 200 shown in Fig. 1 1, the incoming 



watennark signal y'/w7 is input to the signdconcUtioxiingfflterH^^ Thisfater210 is 
typically a band pass filter and has the same behavior as the corresponding filter (He 20) in 
the watermark embedder 10. The ou^ut of the filter Hb isy'h [n]^ and assimiing linearity 
5 within the transmission medium, it follows from equations (1) and (3): 



embedder and the detector is implicitly ignored. For ease of explanation of the general 
watermarking scheme principles, from now on, it is assumed that there is perfect 

10 synchronism between the embedder and the detector (i.e. no ofiFset). It should be noted 

however that if there is not perfect synchronism between the embedder and the detector, then 
the deviation can be compensated for within the bxiffering and interpolation stage 300 
utilizing techniques known to the skilled person e.g. iteratively searching through alternative 
shifts in scale and offset until a best match is achieved. 

15 Note that when no filter is used in the embedder (i.e., when H=l) then Hb in 

the detector can also be omitted, or it can still be included to improve the detection 
performance. If Hb is omitted, then yb in equation (10) is replaced with y. The rest of the 
processing is the same. 

Assuming that the audio signal is divided into frames of length and that y &m/n/ is the n- : 
20 th sample of liie m-th filtered firame signal, the energy E[m] corresponding to the jw-th firame 
is thus: 



where S[n] is the same window shaping fimction used in the watermark conditioning circuit 
of Fig. 10. A person skilled in the art will appreciate that equation 1 1 represents a matched 
25 filter receiver, and is the optimum receiver when the symbol period is perfectiy synchronized. 
Not withstanding this fact, from now on, we set iS/n7=/ in order to simplify subsequent 
explanations. 



(10) 



Note that in tiie above expression, the possible time offset between the 




(11) 
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Combining this with equation 10, it follows tiiat: 

E[m] « = Z|a +ccwAm])x,^[nf (12) 

where WefmJ is the m-th extracted watermark symbol and contains Nb time-multiplexed 
estimates of the embedded watermark sequences. Solving for We[m] in equation 12 and 
5 ignoring higher order terms of a, gives the following approximation: 




In the watermark extraction stage 200 shown in Fig. 1 1 , the output y'b[n] of 
the filter is provided as an input to a firame divider 220, which divides the audio signal 
into jframes of length Ts i.e. into y *b.m[n], with the energy calculating unit 230 then being used 
10 to calculate the energy corresponding to each of the framed signals as per equation (1 1). The 
output of this energy calculation unit 230 is then provided as an input to the whitening stage 
(240) which performs the function shown in equation 13 so as to provide an output WeE^]- 
It will be realized that the denominator of equation 13 contains a term that 
requires knowledge of the host (original) signal x. As the signal x is not available to the 
15 detector, it means that in order to calculate We[m] then the denominator of equation 13 must 
be estimated. 

Below is described how such an estimation can be achieved for the bi-phase 
window shaping function, but it will equally be appreciated that the teaching could be 
extended to olher window shaping functions. 

20 It will be seen by examination of the bi-phase window function shown in Fig. 

2, that when the audio envelope is modulated with such a window function;^ the first and the 
second halves of the firame are scaled in opposite directions. In the detector, this property is 
utilized to estimate the envelope energy of the host signal x. 

Consequently, within the detector, the audio firame is first sub-divided into two 

25 halves. The energy functions corresponding to the fixst and second half frames are hence 
given by 

Ei[m\^'^^Y^\ytA^f (14) 
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and 

ii=r,/2 



10 



respectively. As the envelope of the original audio is modulated in opposite directions vdtbin 
the two sub-ftames, the original audio envelope can be approximated as the mean of E,fmJ 
and E2[m]. 

Furdier, the instantaneous modulation value can be taken as the difference 
between these two fimctions. Thus, for the bi-phase window function, the watermark We[m] 
can be approximated by: 



, ^ 1 ( Ey[m\-Ei[m\ ^ 



(16) 



Consequently, the whitening filter 240 for a bi-phase vvindow shaping 
function can be realized as shown in Fig. 1 0. Inputs 242 and 243 respectively receive the 
energy fimctions of the first and second halve firames Ei[m] and E2M. Each energy fimction 
is then spUt up into two, and provided to adders 245 and 246 which respectively calculate 
1 5 E,[m] - E,[mh and Ei[m] + E^lmJ. Both of these calculated fimctions are then passed to the 
calculathxg unit 248 which divides the value fix,m adder 245 by the value firom 246 so as to 
calculate an estmiateforthevratermarkwe[m], in accordance with equation 16. 

This output ^.[m] is then passed to the buffering and interpolation stage 300, 
where the signal is de-multiplexed by the de-multiplexer 310. buffered m buffers 320 of- 
20 length U so as to resolve any lack of synchronism between the embedder and &e detector, 
and interpolated within the interpolation unit 330 so as to compensate for a possible time 
scale modification between the embedder and the detector. Such compensation can utilize 
known techniques, and hence is not described in any more detail within this specification. 

As shown in Fig. 1 1 , outputs (wdi, wd2, ... w W from the buffering stage are 
25 passed to the interpolation stage and, after interpolation, the outputs (wn, W12. ... w/W of this 
stage, which correspond to the different estunates of the conrectly re-scaled signal, are passed 
to thi correlation and decision stage. If it is beUeved that no time scaling compensation is 
required, the values (woi, ^^2. ... W can be passed directly to the correlation and decision 
stage 400 i.e. the interpolation stage 330 can be omitted firom the ^paratus. 
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The correlator 410 calculates the correlation of each estimate Wff,J=l, ...,Nb 
with respect to the reference watermark sequence Wc[k]. Each respective coirelation output 
corresponding to each estimate is then applied to the maximum detection unit 420 which 
determines which two estimates provided the maximum correlation peak values, and these 
estimates are chosen as the ones that best fit the circularly shifted versions Wdi and Wd2 of the 
reference watermark, and the correlation values for these estimate sequences are passed to the 
threshold detector and payload extractor unit 430, 

If the interpolation stage is omitted, alternatively the correlator 410 calculates 
the correlation of each estimate wdj.J^I. ....Nt with the reference watermark sequence WsfkJ 
and the results are passed on for subsequent processing to the units 420 and 430 as outlined 
in the above paragraph. 

The payload extractor unit 430 may be utilized to extract the payload (e.g. 
mformation content) from the detected watermark signal. Once the unit has estimated the two 
correlation peaks cLi and CL2 that exceed the detection threshold, the distance pL between 
the peaks (as defined by equation (6)) is measured. Next, the signs pj and pj of the 
correlation peaks are determined, and hence Tsign calculated from equation (7). The overaU 
watermark payload may then be calculated using equation (8). 

For instance, it can be seen in Fig. 13 that pL is the relative distance between 
the two peaks. Both peaks are positive i.e. pi = +1, and yc?? = +1. From equation (7), rsig„ = 3. 
Consequently, the payload pLw f = <3, pL>. 

The reference watermark sequence Ws used within the detector corresponds to 
(a possibly circularly shifted version of) the original watermark sequence appKed to the host 
signal. For instance, if Ihe watermark signal was calculated usmg a random number generator 
with seed S within the embedder, then equally the detector can calculate the same random 
number sequence using the same random number generation algorithm and the same initial 
seed so as to determine the watermark signal. Alternatively, the watermark signal originally 
applied in the embedder and utilized by the detector as a reference could simply be any 
predetermined sequence. 

Fig. 13 shows a typical shape of a correlation fimction as output from the 
correlator 410. The horizontal scale shows the correlation delay (in terms of the sequence 
bins). The vertical scale on the left-hand side (referred to as the confidence level cL) 
represents the value of the correlation peak normalized with respect to the standard deviation 
of the typically normally distributed correlation fimction. 
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As can be seen, the typical correlation is relatively flat with respect to cL, and 
centered about cL = 0. However, the ftinction contains two peaks, which are separated hypL 
(see equation 6) and extend upwards to cL values that are above the detection threshold when 

a watermark is present. 

A horizontal line (shown in the Fig. as bemg set at cL = 8.7)represents the 
detection threshold. The detection threshold value controls the false alarm rate. 

Two kinds of false alarms exist: The false positive rate, defined as the 
probability of detectmg a watermark in non watermarked items, and the false negative rate, 
which is defined as the probability of not detectmg a watermark in watermarked items. 
Generally, the requirement of the felse positive alarm is more stringent than that of the false 
negative. The right hand side scale on Fig. U illustiatestheprobabiHty of a false positive 
alarms As can be seen, in the example shown, Ihe probability of a felse positive p=i(r'^ is 
equivalent to the threshold cL = 8.7, whilstp = lO^' is equivalent to cL = 20. 

After each detection interval, the detector determines whether Ihe original 
watermark is present or whether it is not present, and on this basis output a ''yes" or a "no" 
decision. If desired, to improve Ihis decision making process, a number of detection windows , 
may be considered. In such an instance, the false positive probabiUty is a combination of the 
individual probabiUties for each detection window considered, dependent upon the deshed 
criteria. For mstance, it could be determined that if the correlation function has two peaks 
above a threshold of = 7 on any two out of three detection intervals, then the watermark is :.. 
deemed to be present Obviously, such detection criteria can be altered depending upon the 
desired use of the watermark signal and to take into account fectors such as the original 
quality of the host signal and how badly the signal is likely to be cormpted during normal 
transmission. 

It will be appreciated by Ihe skilled person that various implementations not 
specifically described would be understood as felling withm the scope of the present 
invention. 

For instance, whilst the implementation of a particular bi-phase window 
shaping fijnction has been described, and in particular a bi-phase window shapmg fimction m 
which each lobe is a Hamiing function, it wUl be appreciated that the present mvention is 
applicable to any wmdow shaping function falling within the scope of the appended claims. 
The observed reduction in the DC component of the firequency spectrum has been determined 
to be related to having a window shaping function m which the integral over the function is 
zero i.e. the total positive area is equal to the total negative area. Use of such a function 
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reduces the DC component of the fiequency spectrum irrespective of the watermark 
sequence. As useful information is not carried within the DC component, but only within the 
non DC components of the signal, any reduction in the DC component is desirable. 

Fig. 14 shows an example of an alternative window shaping function that 
would still fell withm the scope of the present invention. The function has four lobes. The 
lobes between adjacent zero-crossing points are Hanning window functions. It will be 
appreciated that such window shaping functions can be symmetric or anti symmetric. 

Whilst only the functionality of the embedding and detecting apparatus has 
been described, it will be appreciated that the apparatus could be realized as a digital circuit, 
an analog circuit, a computer program, or a combination thereof. 

Equally, whilst the above embodunent has been described with reference to an 
audio signal, it will be appreciated that the present invention can be appUed to other types of 
signal, for instance video and data signals. 

Within the specification it will be appreciated that the word "comprising" does 
not exclude other elements or steps, that «a" or "and" does exclude a plurality, and that a 
single processor or other unit may fulfil the functions of several means recited in the claims. 
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1 . A method of generating a watermark signal for embedding in a multimedia 
host signal, the method comprising the steps of: taking a first sequence of values; applying a 
window shaping function to said sequence of values so as to form a smoothly varying signal 
suitable for embedding in the host signal; wherein the integral over the window shaping 

5 function is zero. 

2. A method as claimed in claim 1, wherein the window shaping function has an 
anti-symmetric temporal behavior, 

10 3. A method as claimed in claim 1 , wherein the window shaping function has a 

bi-phase behavior. 

4. A method as claimed in claim 3, wherein the bi-phase window comprises at 

least two Manning windows of opposite polarities. 



5. A method as claimed in claim 1 , wherein the frequency spectrum of the 

smoothly varying signal has a DC component less than a component of any non-DC peak 
within the frequency spectrum. 



20 6. A method as claimed in claim 1 , wherein each value of the first sequence is 

represented by a pulse train of width Ts so as to form a rectangular wave signal, the window 
shaping function also being of width Ts. 

7. A method as claimed in claim 1, wherein said first sequence of values is 

25 convolved with the window shaping function so as to form said smoothly varying signal. 



8. A method as claimed in claim 1, the method further comprising the step of 

embedding said smoothly varying signal into the host signal. 
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9. An apparatus arranged to generate a watermark signal suitable for embedding 
in a host multimedia signal, the apparatus comprising: a) a signal generator arranged to 
generate a watemiark signal by taking a first sequence of values; and b) processing means 
arranged to apply a window shaping function to said sequence of values so as to form a 
smoothly varying signal suitable for embedding in a host signal; wherein the integral over the 
window shaping function is zero. 

1 0. An apparatus as claimed in claim 9, wherein the apparatus further comprises a 
watemiark embedding apparatus that embeds said smoothly varying signal into the host 
signal. 

11. A multimedia signal comprising a watennark, wherein the original multimedia 
signal has been watemiarked by a smoothly varying signal foimed by applying a window 
shaping function to a sequence of values, the integral over the window shaping function 
being zero. 

12. A signal as claimed in claim 1 1, wherein fhe temporal envelope of the original 
signal has been modified by the watermark. 

13. A method of detecting a watermark signal embedded in a multimedia signal, 
the method comprising the steps of: 

(a) receiving a multimedia signal that may potentially be watermarked by a 
watennark signal modifying the host multimedia signal; 

(b) extracting an estimate of the watennark fi-om said received signal by assuming 
that the watermark comprises a sequence of values to which a window shaping function has 
been applied, the integral over the window shaping fiinction being zero; and 

(c) processing the estimate of the watermark with a referenced version of the 
watennark so as to determine whether the received signal is watermarked. 

14. A method as claimed in claim 13, the method further comprising the step of 

applying a window shaping function to said received signal, tiie integral over tiie window 
shaping function being zero. 
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15. A method as claimed in claiml3, wherein the wateraiark signal has a payload, 

and the method further comprises the step of determining the payload of the watermark. 




16. A watermark detector apparatus arranged to detect whether a watermark signal 

5 is embedded within a multimedia signal, the watermark detector comprising: 

(a) a receiver arranged to receive a multhnedia signal that may potentially be 
watermarked by a watermark signal modifying the host multimedia signal; 

(b) an extractor arranged to extract an estimate of the watermark fix)m said 
received signal by assuming that the watermark comprises a sequence of values to which a 

1 0 window shaping function has been applied, the integral over the window shaping function 
being zero; and 

(c) a processor arranged to process the estimate of the watermark with a 
referenced version of the watermark so as to determine whether the received signal is 
watermarked. 



17. An apparatus as claimed in claim 1 6, wherein the apparatus further comprises 

a unit arranged to apply a window shaping function to the said received signal, wherein the 
integral over the window shaping function is zero. 
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ABSHIACT: 28. 03. 2002 



A window shaping function is described which has an integral over the 
window shaping function of zero. Compared with conventional window shaping functions, 
Ihis window shaping function improves the robustness of the watermark signal for a given 
quality of host signal. Methods and apparatus suitable for utilizing this window shaping 
function within a watermarking scheme are described. 
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